Where Should Pitch Accents and Phrase Breaks Go? A Syntax Tree Transducer Solution
نویسندگان
چکیده
Motivated by a desire to assess the prosody of foreign language learners, this study demonstrates the benefit of highlevel syntactic information in automatically deciding where phrase breaks and pitch accents should go in text. The connection between syntax and prosody is well-established, and naturally lends itself to tree-based probabilistic models. With automatically-derived parse trees paired to tree transducer models, we found that categorical prosody tags for unseen text can be determined with significantly higher accuracy than they can with a baseline method that uses n-gram models of part-ofspeech tags. On the Boston University Radio News Corpus, the tree transducer outperformed the baseline by 14% overall for accents, and by 3% overall for breaks. These automatic results fell within this corpus’s range of inter-speaker agreement in assigning accents and breaks to text.
منابع مشابه
Building Prosodic Structures in a Concept-to-Speech System
The prosodic structure of utterances in terms of breaks and tones is a significant problem in speech synthesis. In this work we present the results from models used to predict accurate and realistic prosodic structures within the context of a Concept-to-Speech system for a virtual museum guide. We have used a Natural Language Generator system for providing error-free enriched linguistic informa...
متن کاملWavelets for intonation modeling in HMM speech synthesis
The pitch contour in speech contains information about different linguistic units at several distinct temporal scales. At the finest level, the microprosodic cues are purely segmental in nature, whereas in the coarser time scales, lexical tones, word accents, and phrase accents appear with both linguistic and paralinguistic functions. Consequently, the pitch movements happen on different tempor...
متن کاملModeling Prosodic Structures in Linguistically Enriched Environments
A significant challenge in Text-to-Speech (TtS) synthesis is the formulation of the prosodic structures (phrase breaks, pitch accents, phrase accents and boundary tones) of utterances. The prediction of these elements robustly relies on the accuracy and the quality of error-prone linguistic procedures, such as the identification of the part-of-speech and the syntactic tree. Additional linguisti...
متن کاملPhrase Break Prediction Using a Finite State Transducer
This paper presents a method for phrase break prediction using a finite state transducer. In the literature, several algorithms have been proposed using statistical techniques for predicting phrase breaks. Some of these methods rely on linguistic information, such as syllables, words, part-of-speech, accents, etc. Our proposal is a probabilistic finite state transducer to convert part-ofspeech ...
متن کاملPositional variability of pitch accents in Czech
An analysis of prenuclear accents in read speech is carried out with the aim of finding instances of regularity in their distribution. Significant differences are identified with respect to position within the phrase and phrase length, some of which are correlated with declination and pitch span narrowing. Only a weak interaction is found between nuclear and prenuclear pitch accents. No tendenc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011